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j^ ' Abstract 

This paper is devoted to testing time series that exhibit behavior re- 
lated to two or more regimes with diflerent statistical properties. Mo- 
tivation of our study are two real data sets from plasma physics with 
observable two-regimes structure. In this paper we develop estimation 
^1^ , procedure for critical point of division the structure change of a time se- 

^^ ■ ries. Moreover we propose three tests for recognition such specific behav- 

ior. The presented methodology is based on the empirical second moment 
and its main advantage is lack of the distribution assumption. Moreover, 
the examined statistical properties we express in the language of empirical 
i-^ I quantiles of the squared data therefore the methodology is an extension 

fvj . of the approach known from the literature [H 1171 [l] [13] . The theoretical 

results we confirm by simulations and analysis of real data of turbulent 
laboratory plasma. 

Keywords: statistical test, nonstationarity, two regimes, empirical sec- 
ond moment, quantile 
^ , PACS: 02.50.Tt, 02.50.Cw, 02.70.Uu 

1 Introduction 

The main issue in real data analysis is testing distribution. This problem ap- 
pears not only in case of independent identically distributed (i.i.d.) sample 
[221 Uni [2S] but also when we calibrate a model to real data set [H [T21 13] . In this 
case the distribution is fitted to the residual series that is assumed to be i.i.d. 
But many independent variables seem to display changes in the underlying data 
generating process over time [10] therefore they can not be considered as iden- 
tically distributed sample. This typical behavior we observe also in time series 
described in Section |2] that presents increments of floating potential fluctuations 



of turbulent laboratory plasma for the small torus radial position r = 37 cm. 
For this data set the known statistical tests for stationarity mentioned in Sec- 
tion 13] [71 ISl HU [TSl m] are not useful. What more, under some assumptions 
they indicate the data are i.i.d. that is in contradiction with behavior of the 
data observable in Figure [TJ 

Therefore in this paper we introduce three tests that can be useful to time 
series for which we observe more than one regime with different statistical prop- 
erties. Two of them are visual therefore we call them pre-tests and propose to 
use in the preliminary analysis to identify the specific behavior. In order to con- 
firm two or more regimes in the data set we have developed statistical test for 
regime variance. Moreover, we also introduce the estimation procedure for the 
critical point that divides examined time series into two parts with different sta- 
tistical properties. However, the inspection only of the data can lead sometimes 
to the wrong preliminary choice of the model therefore the mentioned tests are 
based on behavior of the empirical second moment of the examined time series. 
The advantage of the methodology based on the empirical second moments is 
emphasized in [3J |2S] and is also confirmed by the bottom panel of Figur^T] 
that presents the squared data for which the difference between two regimes is 
more visible. In the presented methodology we do not assume the distribution 
because the introduced tests exploit only the empirical properties of examined 
data set. What is more important, they can be used to data for which the 
point of division into two regimes is well-defined (is clearly observable), but also 
for data for which the point is not visible. Moreover, we show by simulation 
study that the proposed methodology can be also useful for infinite-variance 
time series. 

The rest of the paper is organized as follows: in Section [2] we present the ex- 
amined data sets that are motivation of developing the presented methodology. 
Next, in Section |3] we overview the known statistical tests for stationarity and 
present the estimation procedure of recognition the critical point introduced in 
[2i] . In Section H] we introduce two visual pre-tests that indicate at specific 
behavior of examined time series, i.e. two regimes related to different statis- 
tical properties. In this Section we propose also the innovative procedure of 
estimation for critical point based on the behavior of empirical second moment 
of real data set and present the simulation study. In Section [5] we introduce 
the statistical method for testing regime variance and test the procedure by us- 
ing simulated data. In the next Section we analyze real data sets form plasma 
physics in the context of presented methodology. Finally, the last Section gives 
a few concluding remarks. 

2 Motivation 

Motivation of our study is presented in Figure [1] real data set. This time se- 
ries describes increments of floating potential fluctuations (in volts) of turbulent 
laboratory plasma for the small torus radial position r = 37 cm. Precise de- 
scription of the experiment is presented in [11| . The signal was registered on 



15 June 2006 with movable probe in scrape-off layer (SOL) plasma of stellara- 
tor "URAGAN 3M". Because the signal was registered every 0.0000016 second 
therefore total length of time series is 30000, but to the analysis we only take 
1900 observations between 12000 — 13900. As we observe in Figure [T] the em- 
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Figure 1: The empirical time series from plasma physics that presents incre- 
ments of floating potential fluctuations (in volts) of turbulent laboratory plasma 
for the small torus radial position r = 37 cm (top panel) and squared time series 
(bottom panel). 



pirical data set exhibits very special behavior, namely the statistical properties 
of the time series change in time. It can be related to the fact that the first 
observations constitute random sample that comes from another distribution, 
than the last part or those two parts come from the same distribution with 
different parameters. Therefore we can suspect that the time series satisfies the 
following property: 



X, 



X for i < / 



Y for i > /, 



(1) 



where X and Y are independent and have different statistical properties and 
I is fixed point. As we have mentioned in Section [1] inspection of the data 
can lead sometimes to the wrong preliminary conclusions, therefore we propose 
to consider the squared time series. As we observe in Figure [1] the difference 
between two parts is more visible for squared data. The statistical properties 
we express in the language of quantiles of squared time series and we assume the 
random variables X"^ and Y^ in relation ([T]) have different quantiles 5^/2 and 
Qi-a/2 for given confidence level a. Here we take the notation qa as quantile of 
order a. 

After preliminary analysis of the data set and confirmation that it constitutes 
realizations of independent random variables (see Figure [TT]) we have tested hy- 



pothesis of the same distribution of time series. The known statistical tests such 
as Augmented Dickey-Fuller, Phillips-Perron or Kwiatkowski-Phillips-Schmidt- 
Shin test for stationarity reject the hypothesis that the data are nonstationary 
(in the sense presented in Section [3]) that suggests they are not useful for this 
data set. Therefore we propose three tests that can be used for data that exhibit 
similar behavior as this observed in Figure [U but also to this that after prelimi- 
nary analysis we can not reject the hypothesis about the same distribution. An 
example is shown in Figure [H This time series presents increments of floating 
potential fluctuations (in volts) of turbulent laboratory plasma for the small 
torus radial position r = 36 cm. Similar as for the first data set, the signal was 
registered on 15 June 2006 and total number of observations was 30000 but to 
illustration we take only observations from 12000 to 15000. After analysis of 
the time plot for series and squared series we can suspect that the data can not 
be considered as identically distributed sample but here the point, when some 
statistical properties change is not so visible as for the first data set. Moreover 
the mentioned tests for stationarity presented in Section |3] indicate that under 
some assumptions the time series can be considered as stationary process but 
in next Sections we will show that this hypothesis is not true. Moreover we will 
find such point that divides examined data into two i.i.d. samples. 
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Figure 2: The empirical time series from plasma physics that presents incre- 
ments of floating potential fluctuations (in volts) of turbulent laboratory plasma 
for the small torus radial position r = 36 cm (top panel) and squared time series 
(bottom panel). 



3 Statistical tests for stationarity 



In order to make any inferences about the structure of a time series we need some 
regularity over time in the behavior of the underlying series. This regularity one 



can formalize using a concept of stationarity, see [B]. We say that time series 
is weakly stationary if the mean of the series is constant over time and the 
covariance between observations on time t and s depends only on their absolute 
difference |s — t|. 

However stationarity is not a common feature of time series and mostly 
we observe nonstationary behavior of the process. There are several types of 
nonstationarity. The trend nonstationarity means that the data posses some 
deterministic trend (for example linear trend) but otherwise are stationary. This 
can be easily seen based on autocorrelation function (for instance linear trend 
can be seen as a linear slow in time decay of autocorrelation function) [5]. 
The second type of nonstationarity is called difference nonstationarity, which 
means the process has to be differenced in order to become stationary. This two 
types examples of nonstationarity are often encountered in real-life data. The 
class of unit-root tests help to distinguish difference from trend nonstationarity. 
Under the null hypothesis that the series is difference nonstationary one can 
mention here Dickey-Fuller unit root tests [71 |H1 [H] and Phillips-Perron unit 
root tests [H]. Testing in opposite direction, namely assuming that time series 
is trend stationary against it is difference one can apply the KPSS test due to 
Kwiatkowski, Phillips, Schmidt and Shin [T3]. 

Mentioned above types of nonstationarity can be successfully tested and 
recognized from the data but they are not the only problems one may encounters 
during data analysis. Atypical observations, level shifts or variance change are 
common features of many real- life data sets [101 El HI] • Neglecting such effects 
may lead to inaccurate estimation of parameters of the model and in consequence 
inaccurate or completely wrong prediction. In present work we will discuss the 
effect of variance change in the data sets, thus there is no trend and differenced 
data have the same behavior as initially before differentiation. Such specific 
two-regimes time series was also considered in [24], where the following model 
for the innovations (independent sample) was considered: 

£» Hi <l , , 

e,{l+ujv) iii>l. ^' 

for some point I, fixed number ujy and under the assumption {e^}"^]^ constitutes 
i.i.d. random variables from normal distribution. We can thus calculate the 
variance ratio of e[ before and after the structural change: 






^' - .: ::f:^a ^ (3) 



where {I — 1) and {n ^ I -\- 1) are greater than zero. The variance ratio is an 
estimate of (1 -I- ojvY ^'^d is likelihood ratio test statistics of variance change 
under the assumption of normality. The test is the most powerful for step change 
in variance when the point I is known. If the critical point is unknown one can 
apply variance ratio statistics to find it. In this case we compute the variance 
ratio statistics for stochastically independent series and obtain its minimum 



frnin and maximum fmax values: 

fmin = min {fi}, 

h<l<n — h 

fmax = max {n}, 

h<l<n — h 

where h is the positive integer denoting the minimum number of observations 
used to estimate the variance at the beginning and at the end of the sample. 
Then we calculate 

f = max{f„^„,f™ax}■ 
The critical point I is the one at which f occurs. 

The complete description of procedure for detecting and adjusting the time 
series with two- regimes structure of the type ([2]) is presented in [24] . Because 
the presented methodology is based on the assumption of normal distribution, 
that is a main disadvantage, therefore in the next Section we introduce the 
innovative procedure of estimation for the critical point that does not require 
any assumption of the distribution. This procedure is based on the behavior of 
empirical second moment of examined time series and is compatible with two 
visual pre-tests for two-regimes structure. 



4 Visual pre-tests for regime variance 

In the first part of this Section we present two visual pre-tests that can confirm 
information if the observed time series Xi,X2,...,X„ constitutes sample that 
satisfy relation ([1]). Those two pre-tests are based on the behavior of empirical 
second moment of the data. In the first method we propose to consider the 
following statistics: 

i 
C,=Y,Xl j = l,2,...,n. (4) 

1=1 

If the random variables X and Y given in relation ([T]) have distributions with 
finite second moments (t\ and it|, respectively , then the statistics Cj has the 
following property: 

^(^^■) = { 7al+l{al-al) Z]l I. ^^^ 

If al = a2, then the mean of Cj statistics is equal to afj for all j = 1, 2, ..., n, 
therefore for i.i.d. sample expected value of the statistics is a linear function 
with the shift parameter equal to zero. Of course this relation is not satisfied for 
distributions with infinite variances, but even in this cases we observe significant 
changes in behavior of Cj statistics. Results of this pre-test we present in Figure 
[3] for different distributions of random variables X and Y in relation (IIJ . We 
consider two cases each consisting of three distributions, namely pure Gaussian 



(A/'(/x,ct)), pure Levy-stable {S{a,f3,<7,fi)), and Gaussian-Levy-stable. In the 
first scenario we consider the case when the parameters of distributions are 
close to each other and thus the structure change point is not well visible in the 
visual pre-test (see left panel of Figure [3]). In the second scenario we consider 
distributions with very different parameters, thus the critical point is observable 
in the simulated sample (the right panel of Figure |31)- 
For the first scenario we consider the following cases: 

• the pure Gaussian case with A/'(0,4) and A/'(0,4.55) distribution for first 
800 and last 1000 observations, respectively, 

• the pure Levy-stable case with 5(1.9, 0, 2, 0) and 5(1.9, 0, 2.5, 0) distribu- 
tion for first 800 and last 1000 observations, respectively, 

• the Levy-stable-Gaussian case with 5(1.8, 0, 1.2, 0) and A/'(0, 2.45) distri- 
bution for first 800 and last 1000 observations, respectively. 

In the second scenario we consider following parameters of distributions: 

• the pure Gaussian case with J\f{Q, 2) and Af{0, 4) distribution for first 800 
and last 1000 observations, respectively, 

• the pure Levy-stable case with 5(1.9, 0, 2, 0) and 5(1.9, 0, 4, 0) distribution 
for first 800 and last 1000 observations, respectively, 

• the Gaussian-Levy-stable case with A/'(0, 4) and 5(1.9, 0, 1, 0) distribution 
for first 800 and last 1000 observations, respectively. 

In the second visual pre-test we observe behavior of the empirical second moment 
of the data from windows of width fc > 0. The examined statistics has the 
following form: 

j+k 
i?,-fc= J2 Xl j=0,l,....n-fc, (6) 

i=3 + l 

where A: is a given positive number called window width. We assume k < I. 
For finite variance distributions of X and Y we can also calculate the expected 
value of Rj^k statistic, namely: 
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ior j + k <l 
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-aj) + l{aj- 


- cr|) + /serf for j + 1 <l < j + k 


kal 




for j + 1 > Z, 



EiRj,k) - < jVI - af) + l{ai - oi) + kai foTJ + l<l<j + k (7) 

[ fco-| for j + 1 > I, 

where a^ and cr| are the second moments of the random variables X and Y 
respectively. As we observe in ([7]), the mean of Rj^k statistics for given window 
width is constant when j<l~korj>l — 1. For l~k<j<l— I the 
statistics has mean that is a linear function with respect to j. When X and Y 
have the same distributions, then expected value of the statistics defined in ([6]) 
is constant for given k. The results of this pre-test are presented in Figure |4] for 
two considered scenarios with different distributions presented above. 
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Figure 3: The Cj statistics defined in (j4]) for two considered scenarios. On the 
left panel we demonstrate results for cases when the parameters of distributions 
are close to each other (the first scenario). The right panel presents cases of 
distributions with very different parameters (the second scenario). 



4.1 Estimation procedure for the critical point 

In this part we introduce the innovative method of estimating the critical point 
of change the statistical properties in the sample that fulfills relation ([T]) . The 
idea of estimation procedure comes from the first visual pre-test described above. 
More precisely, we use the statistics Cj, j — 1,2, ... ,n defined in @ and its 
mean function E{Cj) given in ([5]). 

The algorithm starts with dividing for fixed k — 1,2, . . . ,n the Cj statistics 
into two sets {Cj : j = 1,2, . . . ,k} and {Cj : j = k + l,k + 2, . . . ,n} . Next, we 
fit the linear regression lines yj{k) := ai{k)j + bi{k) and 2/|(fc) := a2{k)j + 
62 (fc) to the first and the second set respectively. From ordinary regression 
theory for such lines the sums of distance squares J2i=ii^j ~ 2/K^))^ ^'^d 
^"■^^. , j^ (Cj — j/^(fc))^ are minimized and therefore the line coefficients have 
the form, see j9]: 



ai 



(fc) = 



E-=ijQ 



(fc+1) 



E^^iQ 



ifc(fc + l)2 + ifc(fc + l)(2fc+l)' 



bi{k) = 



l(2fc + l)E,tiQ-E-=ijQ 

-ifc(fc + l) + ifc(2fc+l) 



The coefficients 02 (/c), 62 (^) have analogous formulas with summation from 
j = k + 1 to n. Our estimator of the point I in relation ([T|) we define as the 
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Figure 4: The i?j,ioo statistics defined in (JH) for two considered scenarios. On tfie 
left panel we demonstrate results for cases when the parameters of distributions 
are close to each other (the first scenario). The right panel presents cases of 
distributions with very different parameters (the second scenario). 



number k that minimalizes mentioned sums of distance squares: 

k n 



I = arg min 

l<fe<ri 



J = l J=k+1 



(8) 



Let us stress that the proposed estimator I is invariant with respect to sample 
distribution. 

We compare the robustness of detecting the critical point of the underlying 
sample satisfying relation ([T]) with the method proposed in jj^l and based on 
variance ratio statistics given in ([3|) . Let us remind that variance ratio statistics 
is intended to detect change point under the assumption of normal distribution 
of the examined series. The procedure is the following: we simulate 1000 trajec- 
tories of length n = 1800 of stochastically independent random variables with 
the change point placed on 800 observation. Similar as for visual pre-tests, we 
consider two cases each consisting of three distributions. Details of examined 
scenarios are presented above. 

The results for the first scenario, where the critical point is not well visible 
are presented in Figure [5l where the first boxplot denotes results of f; estimator 
presented in ([3|) while the second - is related to I estimator defined in ([8]). One 



clearly sees that detection of the critical point based on the I is far more accurate 
than based on n even in case of Gaussian distribution (see panel a in Figure [5]) • 
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Figure 5: Comparison of detection procedure for the critical variance change 
point for two estimators n and /. Panel a: 7V(0,4), A/'(0,4.55), panel b: 
5(1.9, 0, 2, 0), 5(1.9, 0, 2.5, 0), panel c: 5(1.8, 0, 1.2, 0), A/'(0, 2.45). 

The results for the second scenario with clear critical point are presented in 
Figure IHl Also in this case one can see that I estimator performs better than fi . 
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Figure 6: Comparison of detection procedure for the critical variance change 
point for two estimators h and I. Panel a: A/'(0,2), A/'(0,4), panel b: 
5(1.9, 0, 2, 0), 5(1.9, 0, 4, 0), panel c: A/'(0, 4), 5(1.9, 0, 1, 0). 
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5 Statistical test for regime variance 

In this Section we introduce the regime variance test that confirms our assump- 
tion of two- regimes behavior given in relation ([1]). It confirms also the prelim- 
inary results obtained by using the visual pre-tests presented in the previous 
Section. 

The procedure is based on the analysis of the empirical second moment of 
given sample. Let us point that it can be used for distributions with theoretical 
second moment but also for this with infinite variance. Even in this case the 
theoretical second moment exists. Moreover the test is based on the quantilcs 
that without assumption of the distribution we can determine on the basis of 
the empirical distribution function. 

The Ho hypothesis we define as follows: observed time series does not satisfy 
relation ([T|) , that means the quantiles of the squared series do not change in time. 
The hypothesis is satisfied in case of i.i.d. random variables but also in case 
when distributions of two parts (divided by point /) are different but quantiles 
(la/2 and qi-a/2 of squared data are on the same level. 

The Hi hypothesis we formulate as: observed time series has at least rep- 
resentation ([1]), i.e. there are at least two regimes of the data for which the 
appropriate quantiles of squared time series are different. Let us point that 
the Ho hypothesis will be rejected when the squared series has more than two 
regimes. 

The testing of regime variance is based on the assumption the real data 
constitute sample of independent variables therefore before testing we have to 
confirm that given sample constitutes independent data. We propose here to 
use the simple visual method based on the autocorrelation function (ACF) . For 
independent sample the ACF is close to zero for all lags greater than zero. More 
information and basic properties of this methodology one can find in |19) . 

The procedure of regime variance testing for given time series Xi, X2, ..., X„ 
proceeds as follows: 

1. Determine the critical point I according to the procedure presented in 
Section |4l Let us emphasize that also under Ho hypothesis, the I point 
exists and is between 1 and n. 

2. Divide the squared time series into two vectors: Wi = [Xf, ...,Xf] and 
W2 = [A'j^+i, ••■, X^]. Find empirical standard deviations &i and &2 of Wi 
and W2, respectively. For simplicity let us assume that ai < a2- Let us 
point that in case of distribution without theoretical second moment, the 
empirical standard deviation exists and can be calculated on the basis of 
the observed data. 

3. Construct quantiles from the distribution of squared time series from the 
vector Wi (for that the empirical standard deviation was smaller), i.e. 
numbers qa/2 and qi-a/2 that satisfy the relation 

Piqa/2 < Xf < qi-a/2) = 1-a, for each i = 1,2,...,/, 
11 



where a is a given confidence level. Under the Ho hypothesis without the 
assumption of the distribution, the appropriate quantiles we can determine 
on the basis on the empirical cumulative distribution function. Because 
^i+iy^2' ■■■T-^n ^I'S independent therefore the statistics B has Binomial 
distribution with parameters n — I and p = 1 — a. Therefore the p- value 
of the test we calculate as P{Z < B), where Z has Binomial distribution 
with {n ~ l,p) parameters. 

4. If the calculated p- value is greater than the a parameter, then we accept 
the Hq hypothesis. Otherwise if the calculated p-value is smaller than the 
a parameter, then we reject the Hq hypothesis and accept "Hi. 

The complementary part of this Section is the simulation examination of 
the proposed estimator ([5]) and variance regime test described above. First we 
check the committed error of the first order for our test, i.e. the rejecting a true 
Ho hypothesis. For this purpose we generate 1000 trajectories of length 1800 of 
stochastically independent random variables for each of three cases: 

• the Gaussian case with J\f{0, 2) distribution, 

• the Levy-stable case with 5(1.8, 0, 1, 0) distribution, 

• the Gaussian-Levy-Stable case with J\f{0, 1) and 5(1.9, 0, 1, 0) distribution 
for each half of the sample, randomly permuted. 

In our simulations we always assume the significance level a — 0.05 and the 
unknown distribution of samples. Therefore in the testing procedure the em- 
pirical quantiles are applied. We note that the first two cases (pure Gaussian 
and Levy-stable) are the special simplified versions of Ho, i.e. i.i.d. data. 
Obviously the constancy of theoretical quantiles qa/2 E^nd qi-a/2 implies the 
closeness between the empirical versions computed in testing algorithm. The 
Gaussian-Levy-stable case concerns two different distributions of data changing 
dynamically (randomly permuted) on the time domain. Therefore it is contrast 
to the Til hypothesis where different distributions are concentrated in two dis- 
joint time intervals. 

The results of conducted simulations we present in Table [TJ For testing 
procedure we apply the sample mean value of obtained estimators / from each 
generated sample. That mean value of I is 881.37, 916.34 and 104.31 for each 
of three considered cases, respectively. They are close to the half of the sample 
length, which is quite intuitive for data satisfying Ho- We see that extremely 
more times the test correctly does not reject the true null hypothesis Ho and 
the error of the first order is strongly rare, see the Hi-column. Moreover, the p- 
values corresponding to the acceptance of Ho are rightly higher than significance 
level a = 0.05, see Figure [T] The column of Table [T] with p-value contains the 
mean of such p- values. Moreover in the Ho~column and "Hi-column we present 
the numbers of correct accepting and incorrect rejecting Ho, respectively. 
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Figure 7: The boxplots of p-values corresponding to the correct accepting of 
true Ho : 1) A/'(0, 2), 2) 5(1.8, 0, 1, 0), 3) permuted 7V(0, 1), 5(1.9, 0, 1, 0). 



Our next task is to explore the statistical power of the examined test. This 
is equivalent issue to investigation of committing the error of the second order, 
i.e. accepting a false TLq hypothesis. In order to calculate the error of the second 
order, we simulate 1000 trajectories of length 1800 of stochastically independent 
random variables for each of three cases from the first scenario described in 
Section m satisfying the Hi hypothesis. 

In all three cases the differences of distribution parameters are quite small 
and the Hi hypothesis statement can be invisible from the data or its squares, 
see Figure HI This means that we check the efficiency of proposed test in a very 
sophisticated cases. 

We apply the estimator ([8|) and adopt the regime variance test assuming the 
unknown data distribution. The results of conducted simulations with signifi- 
cant level a = 0.05 we present in Table [2] For testing procedure we apply the 
sample mean value of obtained estimators I from each generated sample. That 
mean value of I is 822.28, 943.72 and 646.42 for each of three considered cases 
from the first scenario, respectively. We see that more times the test correctly 
reject the false null hypothesis Ho and the error of the second order is rare, 
see the ?^o~column. The worst result we obtain in the third case with differ- 
ent distributions. However the p-values corresponding to the rejection of Ho 



Distribution of samples 


Ho 


p- value 


Hi 


AA(0,2) 


866 


0.5623 


134 


5(1.8,0,1,0) 


865 


0.5349 


135 


A/'(0, 2), 5(1.9, 0,1,0) 


889 


0.5621 


111 



Table 1: Numbers of correct accepting (with mean p- value) and incorrect re- 
jecting the true Ho- 
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Figure 8: The exemplary samples (left panels) and squared samples (right pan- 
els) for three considered cases of Hi : 1) A/'(0,4), 7V(0,4.55), 2) 5(1.9,0,2,0), 
5(1.9,0,2.5,0), 3) 5(1.8,0, 1.2,0), 7V(0, 2.45). 

are rightly lower than significance level a = 0.05, see Figure [9] The column 
of Table [2] with p-value contains the mean of such p- values. Moreover in the 
Ho-column and 'Hi -column we present the numbers of incorrect accepting T-Cq 
and correct accepting Hi (the power of the test), respectively. We also strongly 
stress that from the construction of the studied test the rejection of Tio hypoth- 
esis is equivalent to the acceptance of TLi. In other words, the rejection of Tio is 
only possible when Hi is true or in the case of first order error. 



Distribution of samples 


Hi 


p-value 


Ho 


A/'(0,4), A/'(0,4.55) 


759 


0.0061 


241 


5(1.9,0,2,0), 5(1.9,0,2.5,0) 


758 


0.0054 


242 


5(1.8,0, 1.2, 0),A/'(0, 2.45) 


652 


0.0044 


348 



Table 2: Numbers of correct rejecting (with mean p-value) and incorrect ac- 
cepting the false Ho- 



6 Plasma data analysis 

In this Section we analyze the real data sets presented in Figure [T] and [2] by 
using the tests for regime variance described in Sections 2] and [5j In Figure [TO] 
we demonstrate results of the visual pre-tests for increments of floating potential 
fluctuations of turbulent laboratory plasma for the small torus radial position 
r = 37 cm (corresponding to Figure [IJ. 

As we observe, the visual pre-tests indicate at the behavior formulated in 
([1]). Moreover we can also determine the critical point I, that divides the time 
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Figure 9: The boxplots of p- values corresponding to the correct rejecting of false 
Ho : 1) A/'(0,4), A/'(0,4.55), 2) 5(1.9,0,2,0), 5(1.9,0,2.5,0), 3) 5(1.8,0,1.2,0), 
A/'(0,2.45). 



series into two independent samples with appropriate statistical properties that 
do not change in time. We estimate the point by using the procedure described 
in Section |4] and get 1055. In the next step of our analysis we test the Hq, i.e. 
the hypothesis that the squared time series has quantiles that do not change 
in time. According to the procedure presented in Section [5] first we confirm 
independence by using ACF function, see Figure 1111 

The regime variance test confirms that the examined data set has at least 
two regimes, i.e. it has representation ([1]). This is related to the fact that with 
confidence level a = 0.05, the obtained p-value is equal to 0.0425 (we reject 
Tio). Because we have estimated the critical point I, that divides the analyzed 
time series into two parts, therefore we can examine if the separate vectors can 
be considered as independent samples with the same appropriate quantiles of 
squared data. In order to do this, we use the regime variance test once again 
for samples Xi, ..., ATioss and Xiose, ...Xigoo- For the first time series, the test 
returns p-value on the level 0.5967, that indicates we can assume the squared 
data have appropriate quantiles that do not change in time. If we test the second 
part of the data set, namely observations from 1056 to 1900, we get p-value 
equal to 0.9829, therefore also for this vector we can conclude that appropriate 
statistical properties do not change. Moreover if we assume the data from two 
considered parts constitute i.i.d. samples (that is one of the possibility when Ho 
hypothesis is satisfied), we can test the distributions. By using tests based on 
the empirical cumulative distribution function completely described in [3], we 
conclude the observations Xi, ..., X1055 come from Levy-stable distribution with 
stable parameter equal to 1.76 and a = 12.14, while the data Xiose, •.-, ATigoo 
have Levy-stable distribution with parameters a = 1.91 and a — 4.2. For both 
samples we use the McCuUoch's estimation method, 1161. 

As we have mentioned in Section [51 the testing procedure we can use also 
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Figure 10: The visual pre-tests for regimes variance of time series that presents 
increments of floating potential fluctuations of turbulent laboratory plasma for 
the small torus radial position r = 37 cm. 




Figure 11: ACF of the time series that presents increments of floating potential 
fluctuations of turbulent laboratory plasma for the small torus radial position 
r = 37 cm. Such behavior of autocorrelation function suggests that the data 
are independent. 



for data for which the critical point I is not so visible as in the previous case, 
see Figure [2] In Figure [12] we present results of the visual pre-tests described 
in Section m for data that describes increments of floating potential fluctuations 
of turbulent laboratory plasma for the small torus radial position r = 36 cm. 
As we observe, on the basis of the behavior of Cj and Rj^k statistics defined in 
(HI) and ([U respectively we can not conclude that the data set exhibits behavior 
described in ([T]). But the procedure of estimating the critical point returns 1763. 
According to the scheme of regime variance testing presented in Section [SJ in 
the first step we confirm independence of the time series. The plot of ACF is 
presented in Figure [T^ 

Next we can test if the hypothesis 'Kq is satisfied for time series presented in 
FigureEJ The obtained p- value equal to 0.0011 indicates the data has at lest two 
regimes with different statistical properties. Similar, as for the first data set. 
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Figure 12: The visual pre-tests for regimes variance of time series that presents 
increments of floating potential fluctuations of turbulent laboratory plasma for 
the small torus radial position r = 36 cm. 




Figure 13: ACF of the time series that presents increments of floating potential 
fluctuations of turbulent laboratory plasma for the small torus radial position 
r = 36 cm. Such behavior of autocorrelation function suggests that the data 
are independent. 



we divide the time series into two separate vectors and test if we can consider 
them as samples for which the characteristics do not change with respect to 
time. For the first part, namely data from 1 to 1763 we get p-value on the 
level 0.593, while for the second vector (i.e. observations form 1764 to 3000) 
the p-vale is equal to 0.591. These results indicate that two considered vectors 
do not satisfy relation ([ij and can be considered as i.i.d. samples. Under this 
assumption we test the distributions and obtain that the two considered parts 
come from Levy-stable distribution. For the first vector we obtain following 
estimates of the parameters: a — 1.9484 and a = 12.9505, while estimated 
values of parameters for the vector containing observations X^q^, ...,^3000 are: 
a = 1.7983 and a = 14.1099. 
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Conclusions 

This paper is devoted to analysis of time series that exhibit two-regimes be- 
havior. We have introduced the new estimation procedure for recognition the 
critical point that divides the observed time series into two regimes with differ- 
ent statistical properties expressed in the language of the quantiles for squared 
data (Sectional). We have developed also three tests that can confirm our as- 
sumption of two-regimes behavior (Sections HI and [5]) . The universality of the 
presented methodology comes from the fact that it does not assume the distri- 
bution of examined time series therefore it can by applied to rich class of real 
data sets. The theoretical results we have illustrated by using the simulated 
time series and analysis of two real data sets related to turbulent laboratory 
plasma. 
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